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Abstract: We propose a theoretical model to describe the strain-induced 
linear electro-optic (Pockels) effect in centro-symmetric crystals. The gen¬ 
eral formulation is presented and the specific case of the strained silicon is 
investigated in detail because of its attractive properties for integrated optics. 
The outcome of this analysis is a linear relation between the second order 
susceptibility tensor and the strain gradient tensor, depending generically on 
fifteen coefficients. The proposed model greatly simplifies the description of 
the electro-optic effect in strained silicon waveguides, providing a powerful 
and effective tool for design and optimization of optical devices. 
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Nonlinear optics; (160.2100) Electro-optical materials; (000.3860) Mathematical methods in 
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1. Introduction 

In recent years, strain engineering is emerging as a new frontier in micro and nano-technology. 
By varying the elastic strain it is possible to turn on physical and chemical properties that 
are absent in the unstrained material. As a result, electronic, optical, magnetic, phononic and 
catalytic properties of a material can be tuned by compressive or tensile stress 0I3- 

In optics, tensile strained germanium and strained silicon are attracting a great deal of in¬ 
terest. Tensile strained germanium-on-silicon can be used as active material for the short- 
wavelength infrared light and it can be an efficient solution for manufacturing monolithic lasers 
and optical amplifiers @ 0 - On the other hand, Pockels effect has been experimentally meas- 





ured in strained silicon making it a promising candidate material for realizing very fast in¬ 
tegrated optical modulators and switches. The modeling of strain-induced electro-optic Pockels 
effect in silicon is the main object of this work. 

The electro-optic effect consists in the change of the refractive index induced by an electric 
field that varies slowly compared with the frequency of an optical signal |[^[7|. In the particular 
case of the Pockels effect, also called linear electro-optic effect, the change in the refractive 
index is proportional to the applied electric field, providing an efficient physical mechanism 
for optical modulation. However, a peculiarity of the Pockels effect is that it arises only in 
crystalline solids lacking of inversion symmetry As a consequence, for centro-symmetric 
crystals (like silicon) the Pockels effect can be observed only when the inversion symmetry is 
broken, e.g., by the presence of significant surface/interface effects IMD or by an inhomoge¬ 
neous mechanical stress. 

Since the seminal work there have been considerable progresses in the fabrication of 
electro-optic modulators based on strained silicon. In 2011 the first fully integrated Mach- 
Zehnder interferometer (MZI), based on strained silicon rib waveguides, was manufactured 
and the value of 122 pmA^ for the effective electro-optic susceptibility was measured (H Two 
years later, the same authors presented a detailed investigation of the local strain distribution and 
of the induced optical nonlinearity as a function of the waveguide width, measuring the record 
value of 190pm/V for a 300 nm large rib waveguide GD- The dependence of the second order 
dielectric susceptibility on both wavelength and waveguide width was later investigated for a 
channel waveguide, showing that higher values of the effective susceptibility can be reached 
for narrow waveguides and large waveleng th |T^ . 

The more recent analysis performed in ||15| and suggests that the phase shift observed 
in previous MZI experiments (and thus the corresponding effective index variation) can be also 
related to the presence of free carrier variation inside the waveguide. The inversion of the phase 
shift observed when switching the applied tension, which was previously interpreted as the 
smoking-gun signal for the Pockels effects, is now attributed also to the surface charge present 
in the silicon nitride cladding. The results presented in dz) where the free-carrier plasma 
dispersion effect in silicon waveguides has been theoretically characterized, further support the 
possibility that the results obtained in |T2p^ and |T^ suffer from contamination of free-carrier 
contribution. 

The most natural way to avoid contamination from free-carriers would be to perform high¬ 
speed measurements, with temporal resolution smaller than the free-carrier response times. In 
particular, an unambiguous indication for the presence of a strain-induced second order sus¬ 
ceptibility is given by second harmonic generation (SHG) measurements. The experimental 
value of the second order dielectric susceptibility extracted from SHG measurements is of 
40pmA^ p^ , which is expected to be of the same order of magnitude of the one related to 
the Pockels effects. The comparison of these two effects can however be only qualitative, since 
they are associated with different frequency components of the nonlinear susceptibility. 

On the one hand, when the free-carrier concentration are known, the corresponding effec¬ 
tive index variation can be predicted by the empirical formula proposed by Soref and Bennett 
in 1987 p^ , and latterly improved by Nedeljkovic et al. pQ| . On the other hand several ap¬ 
proaches have been proposed to model the strain-induced Pockels effect in silicon, however no 
one can be effectively used for practical purpose. 

In it is shown that a simplified classical model of a 2D centro-symmetric lattice is not 
able to reproduce the correct order of magnitude of the experimentally measured susceptibili¬ 
ties, moreover, in this approach, the way in which the strain enters the computations does not 
appear to be completely justified. A more sophisticated model is used in p8| , where the linear 
electro-optic effect in strained silicon is studied by using the time-dependent density-functional 




theory ||^[^. While this ab initio method is theoretically well founded, it has the obvious 
drawback of being computationally very expensive. 

Some intuitive ideas already present in the literature relate the effective susceptibility to the 
strain gradient (see, e.g., |[^[T^[^[^). Indeed, as explicitly noted in d): “Despite the lack 
of general proof for this claim available in the literature yet, it has been widely accepted that the 
second order nonlinear effects in strained silicon are caused by the variations of strain i.e. strain 
gradients inside the crystal.” In this work we will show that such a relation can be deduced by 
using just symmetry arguments and the specific case of the strained silicon will be investigated 
as a particularly interesting example. The final result will be a simple linear relation between 
the second order effective susceptibility tensor and the strain gradient tensor (weighted by the 
electromagnetic modes), depending generically on fifteen independent coefficients that can in 
principle be obtained from experimental measurements. Once these coefficients are known, the 
computation of the electro-optic effect is reduced to a standard strain computation and elec¬ 
tromagnetic mode analysis, thus providing an easy framework for the optimization of optical 
devices. 

2. Nonlinear susceptibility and Pockels effect 

In this section we recall, for the benefit of the reader and to fix the notation, the basic properties 
of the quadratic nonlinear susceptibility that will be needed in the following. 

The quadratic nonlinear susceptibility is conventionally defined, for a local and causal 
medium, by the following relation between the (quadratic component of the) polarization vector 
{t) and the electric field P(r): 

oo 

pP{t) = eo jjx^^l{T\,T:2)Ej{t-X])Ek{t-X2)AxxAx2 , (1) 

0 

(2) 

where eo is the free-space dielectric permittivity, xljk i^ susceptibility tensor. Sum over 
repeated indices is always understood in Eq. 0 and followings unless otherwise explicitly 
stated. The dependence on the position x is omitted for the sake of the simplicity but will be 
important in the following sections. The previous relation can be rewritten in the frequency 
domain as following 

+ (O2) = eoXiJki(^i+(02;(Ou(02)Ej{o}i)Ek{(02) , ( 2 ) 

where the first argument of the susceptibility is just the sum of the two other frequencies, a 
notation conventionally adopted in the literature. 

( 2 ) 

The symmetry properties of the tensor Xijl {C0\CO 2 ', C0\, CO 2 ) will be particularly important 
in our analysis. From the definition and the reality of the fields it easily follows that 

xljl {(0l + C02-,C0i,(02)= xlkj {(0l + C02-,C02,C0i ) (3) 

and 

xlJk{a)i + (02-,a)i,co2)=XiJk{-o>i-a)2-,-cou-co2)*. (4) 

For a lossless and weakly dispersive medium, when no external static magnetic fields are 
present, it can be shown that Xijl CO 2 ; COi ^ CO 2 ) is real and it is invariant under any per¬ 
mutation of the indices, provided the arguments are similarly permuted (see e.g. flSl for an 
explicit computation in perturbation theory or pb} for an indirect argument). In particular, in 
the limit of zero frequencies it is symmetric under a generic permutation of the indices. 


The Pockels effect consists in the variation of the index of refraction at frequency CO when a 
static electric field is applied. To investigate the Pockels effect we thus have to study P^^\(o) 
when the electric field is E{t) = where we denoted by E^^ the static 

(real) electric field and by E^^\(o) the component of frequency (0 of the optical field. The 

('2') 

relevant frequency components of the quadratic susceptibility are thus (to; (0,0) and the 
general symmetries previously discussed become now 

xjjl (ffl; ffl, 0) = x\l] («;0, ft)), 

xfk = X^k{-0);-(0,0) = x% (®;W,0), 


('2') 

thus to, 0) is invariant under permutation of the first two indices. Using these properties 

it is simple to obtain the following form for the polarization in the case of the Pockels effect 

pf-\co) = (a);co,0)£f^(co)£f . (6) 

For the sake of simplicity, we will suppress in the following the superscript of the optical field 
E^^^ that will be simply denoted by E. The phenomenon of second harmonic generation can 

also be described using similar relations, the main difference being that the relevant frequency 

( 2 ) 

components of the second order susceptibility are in that case (2(0; m, m), wich are sym¬ 
metric under the exchange of the last two indices. 

Note that, in addition to the previously discussed symmetries of the quadratic susceptibility, 
the lattice symmetry must also be added in the case of crystals: the tensor is invariant under 
the symmetry group of the lattice. In particular, for centro-symmetric lattices (that are invariant 
under the inversion symmetry x — —x) the tensor has to vanish, like all the invariant 
tensors with an odd number of indices. 

In the following, when discussing lattice symmetries, it will obviously be convenient to work 
in the crystallographic frame, however attention has to be paid to the fact that this frame typi¬ 
cally does not coincide with the device coordinate one, so that a change of reference frame is 
needed to obtain expressions of direct physical application p7| . 


3. The strain-induced Pockels effect 


In the linear theory of elasticity, a small deformation x — 
strain tensor £, defined by 

duj duj 
dxi 


dXi 


'X-\-u{x) is described by the symmetric 


(7) 


where u{x) represents the displacement of a material point |28|. In order to determine the re 


lation between X^^"^ ^nd £ we will follow the same philosophy of effective field theories in 
theoretical physics: the most general expression compatible with the symmetries of the prob¬ 
lem is considered, then the various possible terms are classified according to their “strength” 
retaining only the most relevant ones. The final relation will depend on a number of unknown 
constants, which are usually called “low energy constants” and have to be fixed by comparing 
with experimental data. 

As a starting point we assume that is a local functional of £, i.e. that X^^"^ ^1 point x 
depends only on the values of £ and its derivatives at x (the possible dependence of observable 
quantities on the strain gradient tensor is well known in the litature, see e.g. ID). We then 
assume that this dependence is analytic and we develop everything in Taylor series, thus arriving 



to the following expression 


,( 2 )_^( 2 ), . 


Xijk —Xijk\e=0 + 


ijk 


de, 


ap 


%/3 + 
e=0 


dxS 




Ca(5Y~^ 


dh-f 


ijk 


Cydfi 


^a(5 ^Ydji ' 


e=0 

dhS 




£=0 




e=0 


^ aj 5 ^75 
£=0 


( 8 ) 




where we introduced the shorthand 

^ ( 9 ) 

and where dots stand for terms involving higher derivatives of the strain tensor and higher orders 
of the Taylor expansion. The subscript |e=o means that the derivatives have to be computed at 
vanishing deformation. 

If we now specialize to the case of centro-symmetric crystals, all the coefficients with an odd 
number of indices identically vanish, thus the previous expression becomes 


^y(2) n2y(2) 

(2) ^ ^Xjjk . ^ ^iJk 

d^aPr 

£=0 

In particular all the terms depending on e but not on its derivatives disappear. This is a direct 
consequence of the fact that a uniform strain does not break the inversion symmetry and thus 
cannot induce a non-vanishing quadratic susceptibility. 

The various term in the right hand side of Eq. ( p^ can be classified according to their power 
of the strain and their number of derivatives. In the limit of small deformation and (by writing 
explicitly the dependence on the frequencies and on the position) the leading contribution is 

xjjl (jc; ffli + (O2; ffli, (O2) = Tijkapyicoi + (O2; COi, (O2) Ca/ 3 rW ( 11 ) 

and we thus expect a linear relation between the tensors and 

The tensor T inherits some symmetries from and £: it is symmetric for a ^ j8 and, as 
far as the Pockels effect is concerned, for i ^ j (see Eqs. ^). It is thus useful to adopt the 
contracted index notation f^ij]k{a^}Y ^PP* |^’ points out the fact that only 324 of 
the 3^ = 729 components of T are linearly independent. Since T is an invariant tensor for the 
lattice symmetry, this number can be further largely reduced. In the case of the lattice octahedral 
symmetry typical of the silicon crystal j^, the number of independent components is in fact 
only 15. The general procedure to identify the independent elements is reported in App. 
while the explicit form of Eq. ( pT] ) in the crystal reference frame is given in App.[^ 


CySii ' 


£=0 


( 10 ) 


4. The effective susceptibility 

We have shown in the previous section that a linear relation between the tensor and 

the strain gradient ^ {x) has to be expected, however this local relation is not easily accessible 
by experiments. What is typically measured (see p2}|T^ ) is the variation of the effective re¬ 
fractive index for a propagating waveguide mode induced by the switching-on of the static 
electric field. The change of the effective refractive index due to the Pockels effect can easily 
be obtained (see App. D) and it is given by 

^ ®cL4 , 


( 12 ) 

















where the spatial dependence of the electric fields and of is implied, A is the cross-section 
of the silicon waveguide and the normalization factor N is the active power of the optical mode 
that propagates along the waveguide, given by 

N=\f {ExH*+E* xH)-i^dA, (13) 

2 Jaoo 

where 4 denotes the unit vector parallel to the direction of propagation and Aoo is the whole 
plane orthogonal to the waveguide. In a typical experimental setup, the metal contacts are 
much larger that the optical waveguide and the electrostatic field can be assumed constant in 
Eq. ^ |[13{T4|- 

In order to compare different experimental results, obtained using different devices, it is 
convenient to introduce an effective susceptibility that relates to . It is worth noting 
that, since is a scalar quantity, an effective susceptibility defined in such a way will not 
be a 3—index tensor like but simply a vector. Another aspect that is usually not fully 
appreciated is the fact that the effective susceptibility not only provides a quantitative estimate 
of the non-vanishing properties of but it is also related to the effectiveness of the device 
in maximizing 

There are several definition of the effective susceptibility that are used in the literature and 
indeed, for this reason, some care is required in comparing results from different experimental 
groups (like, e.g., |[^[T^ and fT4]|). The simplest definition is just given by 

xfEi^ = rf^Arf^, (14) 

where the index k is the direction of the static field, like in Eq. ( p^ . When only one component 
of the static field is different from zero, it is straightforward to derive the corresponding entry 
of the effective susceptibility vector. The definition in Eq. ( p^ appears to be the natural gener¬ 
alization to anisotropic optical waveguides of the simple result nAn = which is valid 

for the Pockels effect in homogeneous and isotropic materials, and it follows from = ejeQ 
and e = eo(l 

Using in this definition of effective susceptibility the relation of Eq. O leads to 

Xf{(0\ ffl,0) = Tijka^y{(x)\ ffl,0) Ci3y(®) > (15) 

where we defined the weighted strain gradient as 

Ci3y(®) = J^E*{x)i^apy{x)Ej{x)dA . (16) 

It can be noted that the frequency dependence of ^^0) originates from several sources: 

the possible explicit dependence of T(to; (0,0) on to, the presence of in the definition of the 
weighted strain gradients and, implicitly, through the form of the optical waveguide mode E (x) 
in Eq. ( p^ . 

Since the experimentally observed dependence of x^^ on the frequency is very mild (see 
p^), it is reasonable to assume that the tensor T is frequency independent, i.e. that all the 
dependence of xf^ on (O can be explained by the dependence of the weighted strain gradient 
on (0. It should be clear that this is not expected to be always true and this assumption can only 
be justified/disproved by its ability/inability to reproduce experimental data. When new, more 
precise, data will appear, it is conceivable that this hypothesis will have to be relaxed. 

5. Investigation of strain-induced susceptibility in fabricated devices 

In this section we investigate two waveguide fabricated and characterized in and (E), 

respectively. The 15 independent entries of tensor T can be potentially computed by fitting the 





experimental results. However, because the results in the literature might be contaminated by 
the free carrier effect |[^[^, we limit our analysis to some general properties of the strain- 
induced showing the role played by the geometry to provide different effective second 
order susceptibility. 

5.7. Devices under investigation 
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Figure 1. Cross-section of the strained silicon based MZI studied in fT^fTS) and (E). 
respectively. In (a) the slab waveguide cross-section described in (El |13| . The cases of 
waveguide width (wsi) equal to 300 nm, 350 nm, 400 nm, 450 nm, and 500 nm have been 
investigated. In (b) the channel waveguide cross-section described in (E) The cases of 
waveguide width (wsi) equal to 385 nm, 435 nm, and 468 nm have been investigated. Pic¬ 
tures are not to scale. 


The waveguide cross-section used in |[^[^ is schematically shown in Fig. |l(a)| A silicon 
rib waveguide was manufactured on a silicon-on-insulator (SOI) substrate with a 220 nm thick 
(lOO)-oriented top silicon layer over a 3 jim thick buried oxide. Silicon waveguides are fabri¬ 
cated by an etching process that leaves a slab thickness of 45 nm. A 350 nm thick Si 3 N 4 layer 
is deposited using remote plasma enhanced chemical vapor deposition and after the annealing 
process, a protective Si02 cladding layer is deposited on the top (850 nm thick). 

The geometry adopted in (E) is slightly different and it is schematically shown in Fig 


1(b) 


A fully etched channel waveguide is fabricated on a 260 nm thick (lOO)-oriented top sili¬ 
con layer with 2 jim buried oxide. The silicon waveguide is eventually covered by a single 
700 nm layer of Si 3 N 4 . This choice was motivated by the fact that the protective Si02 cladding 
layer on Si 3 N 4 was observed to reduce the overall stress in the silicon wafer and the induced 
nonlinearity. 


5.2. Strain simulation details 

In order to investigate the consequences of Eqs. ( p?] ) and ( p^ , we need to evaluate the strain 
gradient for the structures described in the previous section. Since silicon elastic properties 
significantly depend on the orientation of the crystalline structure, they have been taken into 
account in the mechanical simulation. To describe the deformation of silicon, one possibility 
could be to make use of the Hooke’s law, i.e. the linear relation between strain and stress, 
which for materials with cubic symmetry involves only three independent components pQ| . 
A more convenient description, that avoids tensorial transformation, is however the one that 
















makes use of the orthotropic model p0| . A material is said to be orthotropic when it has at 
least two orthogonal planes of symmetry. Its elasticity can be described by a matrix that takes 
into account the fundamental elasticity quantities in the axes of interest: the Young’s module 
(7), the Poisson’s ratio (v) and the shear modulus (G). In this work we use the letter Y for the 
Young’s module instead of E, traditionally used, to avoid confusion with the electric field. 

The most common use of orthotropic expressions for silicon is to provide the elasticity values 
in the frame of a standard (lOO)-silicon wafer. When z = [110], v = [IlO], y = [001], like in the 
device investigated in (T4) the elasticity moduli are pQ| 

F;, = 169GPa F^ = 130GPa F^ = F;, 

= 0.36 V3;^ = 0.28 = 0.064 (17) 

Gjcy = 79.6 GPa Gy, = G^y G^,, = 50.9 GPa. 

Similar relation can be derived for the device in W , where v and z are switched in Eq. ( p^ . 

The deformation of the silicon waveguides has been computed using COMSOL multi¬ 
physics tools pT| . Assuming 1 GPa compressive stress as the initial condition for the silicon 
nitride layer, the elastic strain has been computed for the structures shown in Fig |l(a)| and 
Fig |l(b)| as a function of the waveguide width and the wavelength. For the silicon waveguide 
and the buried silicon we used the values of the elastic modulus, the shear modulus and the 
Young’s modulus in Eqs. ( p^ . Since for the solid analysis a 2D model has been considered, the 
components Ex, and £y, of the strain identically vanish; for the same reason, the derivative of 
the strain coefficients with respect to z are assumed equal to zero. 



Figure 2. Strain profile of silicon waveguide used in Chmielak et al. |T^ : (a) ^XX-> (b) Eyy, 
(c) (d) ^xy- 
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Figure 3. Electric field components in the case of Chmielak et al. GD 



Figure 4. Strain profile of silicon waveguide used in Damas et al. 0: (^) ^XX-^ (b) Cyy, (c) 

^ZZ'' (^) ^XJ’ 


5.3. Results 

In Fig. we show the behavior of the strain components from which we can evaluate the 
strain derivatives, in Fig. the results of the electromagnetic mode analysis, performed using 
COMSOL multi-physics tools (ID The geometry taken into account is the waveguide cross 
section for the device described in m The same results for the device described in (E) are 
respectively shown in Figs. andFocusing on the mode analysis, it is worth noting that in 
both cases the waveguides show a single mode behavior, however the Ey and components 
are not negligible compared to the E^ component and thus the mode is not purely transverse 
electric. The high value of E^ is due to the high index step of the waveguides which requires in 
fact a full vectorial mode solver to be accurately computed 
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Figure 5. Electric field components in the case of Damas et al. (E) for an incident wave¬ 
length of 1550 nm. 


Working in the device coordinate frame reported in Figs. 1(a) and 1(b) 


and with directed 

along the y-Sixis, the relation between the effective susceptibility and the strain gradient can be 
written in the form 

'^{0))=CiOi{0)) , (18) 


yeff/ 

Ay 


where the coefficients (/ = 1,..., 15) are the independent entries of the tensor T and the terms 
Oi are linear combinations of the weighted strain gradients that will be called overlap functions 
in the following. Their explicit form is reported in AppJH Eq. ( [3T] ) and it is the same for both 
the crystallographic axis orientation considered in ms and 0 . For these orientations some 
of the overlap factors turn out to coincide: 


03 p 7 = 03 = 0 j, d>5p6 = d>5 = (96, Oupll = On = On, (19) 

SO that only 12 independent coefficients are needed in this case, instead of the 15 ones required 
for a generic device orientation. 

It is reasonable to expect that, for practical purposes, the number of numerical constants to 
be fixed can be further reduced. Indeed we numerically estimated the overlap functions cor¬ 
responding to the experimental setups used in 0 and 0 and observed that in both cases a 
clear hierarchy can be observed: the values of 02 , osp^, og, oio and onpii are significantly larger 
than the others, with marginal contributions from ou and 6 ^ 15 ; all the other overlaps are smaller 
by an order of magnitude or more. In Figs.j^and Fig.[7]these most significants overlap are dis¬ 
played for various waveguide width and wavelength. Since all the symmetries of the problem 
have been taken into account when determining the independent components of the tensor T, 
it is natural to expect all the Ci to be about the same order of magnitude. As a consequence 
one expects that the coefficients that will be more important to reliably describe experimental 
results are the ones multiplying the dominant overlaps. This reduces the number of independent 
constants to be determined to 7 in our more conservative estimate. 

From Figs. 1^ andwe see that the dominant contributions are the ones denoted by og, osp^ 
and 6 > 2 . It is thus interesting to look at the specific form of these overlaps in Eq. ( [ST] ). The 
largest field component is and it is natural to expect that these large overlaps are weighted 
by Ejc, which is indeed the case. Regarding the strain-gradient components, o^pe and og involve 
d£xx/dy, which has been often assumed in the literature as the only relevant component of the 
strain-gradient; however 02 is independent of this component and is related to dSyy/dy, whose 
importance has been overlooked so far. 

As explained in the introduction, we cannot at present estimate the numerical values of the 
constants Ci in Eq. ( p^ for lack of reliable (i.e. for which we can safely exclude a contamination 
from free carriers effects) experimental data to compare with. It is however interesting to note 
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Figure 6. Behavior of the overlap functions for the waveguides under investigation with 
respect to waveguide width for (a) the device used by Chmielak et al. O and (b) the 
device used by Damas et al. El- In both cases only the most significant overlaps are 
plotted at A = 1550™. 




Figure 7. Behavior of the overlap functions for (a) the waveguide used in Chmielak et 
al.E) and (b) the waveguide used in Damas et al. E) with respect to the wavelength for 
a waveguide width wsi = 3^5 nm. Only the most significant overlaps are plotted. 


that, also without any knowledge of these constants, we can reach an important conclusion on 
a point that attracted some attention in the literature, i.e. the relevance of fabrication defects on 
the strain-induced Pockels effect. In particular in |T3| was noted that the strain profile of the 
studied structure was quite sensitive to the presence of lateral cracks in the Si 3 N 4 overlayer. 
When such defects are present, we observe a strong increase of both strain components and 
strain gradients in proximity of the lateral cracks, as shown in Fig. However, as can be seen 
from Fig.|^ the defects do not induce any sizable modification of the overlap functions. 

The physical explanation for this result is that the device regions in which the strain is sig¬ 
nificantly affected by the defect are also the regions where the electromagnetic field is weak, so 
that the net effect of the crack on the quantities in Eq. ( p^ is small. This simple fact was previ¬ 
ously overlooked, probably due a confusion that is sometimes present in the literature between 
the strain-gradient dependence of the local non-linear susceptibility in Eq. (U) and that of the 
effective susceptibility, in which the strain-gradient tensor enters weighted by the electromag¬ 
netic field. Having explicitly checked that the overlaps does not appreciably changes when the 
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Figure 8. Strain components profile £xx (a), %(b), %(c) and ex};(d) in the strained silicon 
waveguide (H) in presence of defect fabrication in the Si 3 N 4 slab. 
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Figure 9. Comparison between the overlap factors for the case of Chmielak et al. GD with 
and without fabrication defect. 


lateral crack is present, we can thus safely conclude (also without any knowledge of the numer¬ 
ical coefficient Ci) that the dependence of the effective susceptibility on this type of fabrication 
defects is very weak, as far as the strain-induced Pockels effect is concerned. 

6. Conclusions 

We have proposed an effective model that describes the strain-induced dielectric susceptibil¬ 
ity in centro-symmetric crystals starting from symmetry arguments. The specific case of the 
Pockels effect in strained silicon has been investigated as a particularly interesting example. 










































with potential applications in silicon photonics for implementing modulation and switching 
functions, however the idea of the approach is extemely general and can be applied to any 
centro-symmetric crystals (in fact also to non centro-symmetric ones) and to other physical 
phenomena, like e.g. second harmonic generation. 

The main result of our analysis consists in a simple relation between the second order di¬ 
electric susceptibility and the strain gradient tensor. We have shown that the effective second 
order susceptibility of strained silicon cannot be accurately described by considering only me¬ 
chanical deformations but the combination of optical modes and mechanical stress analysis is 
required. As a result the effective susceptibility can be written as a linear combination of the 
weighted strain gradient components (defined in Eq. ( p^ ) with fifteen independent coefficients 
in the general case. This approach put on firm ground some intuitive ideas present in the recent 
literature, that suggested a relation between the strain-induced effective susceptibility and the 
strain gradient. 

In the experimental settings studied in and 0 the number of independent coeffi¬ 

cients to be determined to completely parametrize the strain-induced Pockels effect reduces to 
12 , moreover we gave indications that, for all practical purposes, this number can be effectively 
reduced to 7. 

When new experimental measures of the effective strain-induced susceptibility (free from 
spurious effects like the ones related to free carriers) will be available, it will be possible to 
estimate these coefficients.This will have a considerable impact on the design and optimiza¬ 
tion of electro-optic modulators based on strained silicon, since it reduces the computation 
of electro-optic effect to standard mechanical and optical computations, thus allowing device 
optimization in terms of silicon waveguide geometries, crystallographic axes orientation and 
electrode position. 


A. Contracted index notation 

When dealing with tensors which are invariant under permutation of some indices it is conve¬ 
nient to introduce a compact notation, in order to simplify expressions. It is customary to define 
the contracted index notation (denoted by curly brackets) as follow 


{11} ^1, {22} —> 2, {33} ^ 3, 

{23},{32}->4, {13},{31} ^5, {12},{21} ^ 6 . 


( 20 ) 


Using this contracted notation a symmetric 2—index tensor V (i.e., Vij = Vji) can be written as 
a vector V with 6 entries, whose components are with i < j, and analogous simplifications 
occurs also for higher order tensors. It is important to note that, while the index contraction 
make some manipulations easier, the tensor transformation and multiplication properties are 
changed by this index replacement and some care is required. 

In contracted notation, the 6 -index tensor T tensor introduced in Eq. 0 can be written (for 
the case of the Pockels effect, see the end of Sec.|^ as the the four index tensor 


where j-indices take value in 1 , 2 ,3, while hj 4 = 1 , 2 ,3, and /i, 4 = 1 ,..., 6 . 


( 21 ) 


B. Symmetry analysis of the tensor T 

In this appendix we report the algorithm used to write down the independent components of 
the 6 —index tensor T introduced in Eq. ( pT| ). It is first of all convenient to rewrite the tensor T 
as a column vector by introducing a lexicographic labelling of the indices. The component 




Ti^h-ie (^ 1 : • • • 5 ^6 = 1 : 2 ,3) corresponds to the component T[ where 

i = 3^ ^ (^5 — 1) +... + 3 (^2 ~ 1) + • 


(22) 


In order to impose the invariance properties of T, the general transformation law for ten¬ 
sors have to be translated in a form suitable to be applied to the vector . If R is the matrix 
representing the coordinate transformation, let us define the square matrix A as 




(23) 


where the relation between i and /i,..., /6 is the same as Eq. ^22\ and analogously for the js. The 
invariance of the tensor T under lattice symmetries is thus equivalent to the relation = AT^, 
moreover all the entries of A are integers in the crystallographic base 

In the case 5 - symmetry operations are present, with associated matrices A^^\ the vector 
has to satisfy the linear system 




(24) 


where I is the identity matrix. 

To identify the independent elements of the tensor T, symmetries under index permutation 
must also be taken into account. For this purpose, let us introduce the column vector T asso¬ 
ciated to the contracted tensor T, defined by tf = where / 2,4 = 1,2,3, /i, /s = 1,..., 6 

and 

/ = 6 • 3 • 6 • (4 — 1) + 6 • 3 • (^3 — 1) + 6 • (/2 — 1) + 4 • (25) 

The relation between f and can be written in matrix form as 


Tc ^ 


(26) 


where C is a 729 x 324 matrix, whose rows have only a single element different from zero and 
equal to 1. Combining Eq. ( [24| ) and Eq. ( |^ we arrive to 


/C-A(')C\ 

[c-A^^^cJ 


T = NT 


(27) 


where iV is a rectangular matrix. Because N is real, rank(IV) = rank(IV^iV), so we can multiply 
Eq. ( [Z7] ) by preserving the rank and reducing the number of equations. Reducing the system 
in the echelon form by applying the Gauss algorithm p4| , we finally obtain 

(0 0) (%)=». 

where A is the permutation matrix of the Gauss algorithm. In the previous equation, we split 
the vector T into two blocks: T is the vector of the independent components while T = 
is the vector that contains the dependent entries of f. The number of dependent entries 
of T is thus given by the rank of N and the independent elements of tensor T span the kernel 
of A. 





C. Explicit form of some relations for the octahedral lattice 

By applying the algorithm presented in Section we derived the explicit form of the local 
relation Eq. O between and ^ that has to be applied in the case of silicon. This explicit 
form is frame dependent and the expressions in Eq. ( [29] ) are written with respect to the crystal 
axis X = [100], y = [010], z = [001]. 

^111 = +C4(^221 +qC331 + 2 ci 3 ^122 + 2 ci 3 ^ 133 ; 

Xni = 1^121 +<35^112 +C2C222 + ^^6^332 + 2C12C233; 

Xm = 2cii^i3i + 2 C 12 C 232 + Cl 13 +<36^223 +<32^333; 

^221 = ^^2Clll +<35C221 +^^6C331 + 2 cii Ci22 + 2ci2Ci33 ; 

^222 = 2ci3Ci21 + <34Ci12 + ^^1 C222 + ^^4C332 + 2ci3C233; 

X 22 ?> = 2C12C13I +2C11C232 + ^^6Cll3 +<35C223 +<32C333; 

^331 = <32Ci 11 +<36C221 +^^5^331 + 2ci2Cl22 + 2cii Cl33; 

^332 = 2 C 12 C 12 I +<36Ci12 +^^2C222 + <35C332 + 2 C 11 C 233 ; 

^333 = 2 ci3 Cl31 + 2(313 C232 + ^^4 Cl 13 + Q C 223 + <3i C333 ; 

;^231 = 2C9C23I+2cioCi32 + 2cioCi23; 

^232 = 2ci4Ci31 + 2ci5C232 + oCi13 + <38C223 + <33C333; 

^233 = 2ci 4 C12I + <37 Cl 12 + C '^ C222 + ^^8 C332 + 2(3i5 C233; 

^131 = 2ci5Ci31 +2C14C232 + ^^8Ci13 +<37C223 +<33C333; 

;^132 = 2C10C23I +2c9Ci32 + 2cioCi23; 

^133 = ^^3Clll +<37C221 +^^8C331 + 2 ci4Ci22 + 2ci5Ci33; 

^121 = 2ci 5 C12I + <38 Cl 12 + C222 + O C332 + 2 ci4 C232; 

;^^122 = ^^3Clll +<38C221 +oC331 + 2ci5 Ci22 + 2ci4Ci33 ; 

^123 = 2 C 10 C 231 +2CloCl32 + 2c9Cl23• 

The coefficients Ci that appear in this expression are related to the independent components of 
T by the relations 

^^1=^3333, <32 = 72333, C 3 = 14233, Q = 73323, C 3 = 12313, 

<36 = 7 i 323, <37 = 75123, <38 = 74223, ^^9 = 76363, <310 = 75263, (30) 

^^11= 73153, ^^12 = 72153, ^^13 = 7ii53, C 14 = 76253, ^^15 = 75353- 

The explicit form of the overlap functions to be used in Eq. ( [T^ in the main text are here 
reported with respect to the coordinate frames used in |[^[^ and (T4) (the weighted strain 
was defined in Eq. ([T^): 


oi = , 

03 = Re{S + ^} , 

^5=2 ^xxy + 2 ~*~ 2 ~*~ 2 ’ 

Ol = < 93 , 

^ \ ^_T^_T^ 

^9 — hxxy I hzzy hxxy ^zzy ’ 

<^11 = ~ 5 

013 = 2^ , 
oi5=4Re{^}. 


— l^yyy + Cyyy , 

^4 — I bzzy , 


^6 = ^5, 

08=2Me{^} , 

^10 = ^^xyx ~ , 

On = ^11 , 

0i4 = 2Re{C-'^} , 


D. The variation of the refraction index 


( 31 ) 


In this appendix we report some details on the expression for the variation of the effective 
refraction index induced by a non-vanishing X^'^\ following an argument analogous to the one 
used in Refs. | [35l . 

Let us consider the following form of the reciprocity theorem for a waveguide (see e.g. p6| 
§31-1): 


- [ 

dzJAoo 


Fc •4dA 



VF,dA, 


(32) 


where the waveguide is directed along the z direction, Aoo is the plane orthogonal to it and the 
vector Fc is defined by 

Fc=ElxH^ExHl . (33) 

In the following Eq and Hq will be the optical fields propagating in the guide when = 
0, while E and H will be the corresponding fields when the nonlinear susceptibility is non¬ 
vanishing. It is simple to show that, when all the field have angular frequency (0, Eq. ( [32| ) can 
be rewritten in the form 


dz 



xH + Ey.m)-i^dA = m f E*n-P^^'>(iA , 

Ja 


(34) 


where is the polarization induced by the nonlinear susceptibility (P = Pq + and A is 
the section of the wave guide in which X^^^ is non-vanishing. 

We will use for the unperturbed fields the form 


Ho{r,t)=ho{x,y,(Oo)e‘^'^^-^‘K 


while we will assume for the perturbed fields the expressions 

E{r,t) = u(z)eo(x,y;o)o)e‘^'‘^~‘^‘^ , 

H{r,t) = uiz)ho{x,y,(Oo)e‘^'^^-^‘K 


thus assuming that the nonlinearity does not significantly affect the transverse modes. 

Using these forms of the fields in Eq. ( [34| ), together with Eq. ([^, which can be written as 

= 2£oX^^\o)-,o),0) : EE‘‘‘^, 


(37) 


we obtain a differential equation for the envelope function u{z), namely: 



( 38 ) 


where we introduced the notation 


2£oJ^e^-X^^'>:eoE‘^^dA 


(39) 


{eo xhQ + x ho) ■ 4 dA ' 


By solving this equation and inserting the solution in Eq. ( [3^ we finally obtain k = ko-\- (oX, 
thus corresponding to the variation = cX of the effective refraction index. 
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